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Abstract. We generalize classical results on the gap distribution (and other fine- 
scale statistics) for the one-dimensional Farey sequence to arbitrary dimension. This 
is achieved by exploiting the equidistribution of horospheres in the space of lattices, 
and the equidistribution of Farey points in a certain subspace of the space of lattices. 
The argument follows closely the general approach developed by A. Strombergsson 
and the author [Annals of Math. 172 (2010) 1949-2033]. 



Denote by Z ra+1 the set of integer vectors in IR™ +1 with relatively prime coefficients, 
i.e., Z n+1 = {m e Z n+l \ {0} : gcd(m) = 1}. The Farey points of level Q G N are 
defined as the finite set 

(1) Jq= j^G [0,1)": (p,g)GZ" +1 , 0<g<Q 

The number of Farey points of level Q is asymptotically, for large Q, 

Q n+1 



( 2 ) \Fq\ ~ a Q :z 



(n + l)((n + iy 



In fact, for any bounded set V C [0, l) n with boundary of Lebesgue measure zero and 
non-empty interior, 

(3) \F Q n V\ ~ vol(P) a Q , 

which means the Farey sequence is uniformly distributed in [0, l) n . 

The objective of the present paper is to understand the fine-scale statistical properties 
of J-q. To this end, it will be convenient to identify [0, l) n with the unit torus T n = 
M n /Z n via the bijection [0, l) n — > T n , x i— > x + Z™. We will consider the following 
two classical statistical measures of randomness of a deterministic point process: Given 
k G Z> and two test sets V C T n and A C W 1 , both bounded, with boundary of 
Lebesgue measure zero and non-empty interior, define 

vol{x 6 V : |(s + a Q l/n A + Z») n T Q \ = k] 

(4) P Q (k,V,A) = ^ 

and 

\{t G J-q Pi T> : \(r + a Q 1/n A + Z n )n F Q \ = k}\ 



(5) P , Q (k,V,A) 
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The scaling of the test set A by a factor (Jq^ 1 ensures that the expectation value 

oo 

(6) EP Q (k,V,A) :=Y,kP Q (k,V,A) 



k=0 



is asymptotic to vol (.4.) for large Q. The quantity Po,q(0, V, A) is the natural higher 
dimensional generalization of the gap distribution of sequences in one dimension, which, 
the case of the Farey sequence for A = [0, s] and V — T, was calculated by Hall 
[6]. Pq(0,T, [0,s]) corresponds in one dimension to the probability that the distance 
between a random point on T and the nearest element of the sequence is at least s/2. 
An elementary argument shows that in one dimension the density of this distribution 



is equal to P ,q(0,T, [0,s]), see e.g. [HI Theorem 2.2] and eq. (36) below. The most 
comprehensive result in one dimension is due to Boca and Zaharescu |3j , who calculate 
the limiting n-point correlation measures. We refer the reader to the survey article jl] 
for an overview of the relevant literature. 

The results we will discuss here are valid in arbitrary dimension, and will also extend 
to the distribution in several test sets Ai, . . . , A s . To keep the notation simple, we will 
restrict the discussion to one test set; the proofs are otherwise identical, cf. [151 Section 
6] for the necessary tools. 

It is evident that the distribution of Farey sequences is intimately linked to the dis- 
tribution of directions of visible lattice points studied in [151 Section 2]. The only 
difference is in the ordering of the sequence of primitive lattice points and the way they 
are projected: In the Farey case we take all primitive lattice points in a blow-up of the 
polytope {(x,y) G (0, l] n+1 : xj < y}, draw a line from each lattice point to the origin 
and and record the intersection of these lines with the hyperplane {(x, 1) : x G W 1 }. In 
the case of directions, we take all points in a fixed cone with arbitrary cross-section pro- 
jected radially onto the unit sphere. Since the cross section of the cone is arbitrary, this 
yields (by a standard approximation argument) the statistics of primitive lattice points 
in the blow-up of any star-shaped domain (with boundary of measure zero), which are 
projected radially onto a suitably chosen hypersurface of codimension one. The proof 
of a limit distribution for Pq[k,T>, A) for Farey fractions is therefore a corollary of the 
results of [15] . 

If the points in Tq were independent, uniformly distributed random variables in T", 
we would have, almost surely, convergence to the Poisson distribution: 

(7) lim P Q (k,V,A) = lim P 0tQ (k,V,A) = ^M^ e -^(A) ag 

The Tq are of course not Poisson distributed. But, as we will see, the limit distributions 
exist, are independent of P, and are given by probability measures on certain spaces of 
random lattices in M n+1 . The reason for this is as follows. 
Define the matrices 

and the cone 

(9) €(A) = {(x, y)eR n x (0, 1] : x G a~ 1/n yA} C E n+1 . 
Then, for any (p,q) G K n+1 , 

(10) - G x + a 1/n A, 0<q<Q, 

q v 



vo\{x £ V : 


z 


n+ 


1 /i(^)a(g) nc(A)\ = k} 




vol(D) 
Z n+1 /i(r)a 


(Q) n €(A)\ = k}\ 




F Q nv\ 
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if and only if 

(11) (p,q)h(x)a(Q)e€(A). 
Thus, if Q is sufficiently large so that a~^^ n A C (0, l] n , then 

(12) \{x + cx Q 1/n A + Z n ) n F Q \ = \Z n+1 h{x)a{Q)n<£{A)\. 

This observation reduces the question of the distribution of the Farey sequence to a 
problem in the geometry of numbers. In particular, Q and ^ can now be expressed 
as 

(13) P Q (k,V,A) 
and 

(14) P 0>Q {k,V,A) = 

Let G = SL(n+l,R) and T = SL(n+l, Z). The quotient T\G can be identified with the 
space of lattices in IR ra+1 of covolume one. We denote by // the unique right G-invariant 
probability measure on T\G. Let furthermore be /zo the right Go-invariant probability 
measure on r \G , with G = SL(n,R) and r = SL(ra, Z). 
Define the subgroups 

(15) H = |m £ G: (0,1)M = (0,1) J = j ^j:Ag G , b £ R n | 
and 

(16) rv = rn#={(^ ^ : 7 e r , m e z n J. 

Note that if and T# are isomorphic to ASL(n, R) and ASL(n, Z), respectively. We 
normalize the Haar measure //# of H so that it becomes a probability measure on 
Tjj\H. That is, 

(17) d/i H (M) = dn (A) db, M = (q i 

The main ingredient in the proofs of the limit theorems for P 0i q(&, P, *4) and -Pq(&, V, A) 
are the following two equidistribution theorems. The first is the classic equidistribution 
theorem for closed horospheres of large volume (cf. [Tj)J Section 5] for background and 
references), the second the equidistribution of Farey points on closed horospheres [121 
Theorem 6]. In the latter, a key observation is that [121 ec L- (3.53)] 

(18) Th(r)a(Q) £ T\THa(2 ) ~ r H \Ha{%). 
Theorem 1. For f : T n x T\G — > R bounded continuous, 

(19) lim / f(x,h(x)a(Q))dx= [ f{x,M)dxd fJl {M). 
Theorem 2. For / : T n x r\G — >■ R bounded continuous, 



(20) 



lim V) f(r,h(r)a(Q)) = [ [ f(x,Ma{\~^))dxd^ H {M)d\. 
K 3 1 r ^ Q JT»xr H \H 
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Both theorems can be derived from the mixing property of the action of the diagonal 
subgroup {a(y)} y£ s. >0 . The exponential decay of correlations of this action was exploited 
by H. Li to calculate explicit rates of convergence [9]. One can furthermore generalize 
Theorem [2] to general lattices r in G and non-closed horospheres [13] . Theorem [2] may 
also be interpreted has an equidistribution theorem for periodic points of the return 
map of the horocycle flow (in the case n = 1) to the section 

(21) r\TH{a(y) : y G R^} ~ T H \H{a(y) : y G R yl } 



which is discussed in pQ. The identification of (21) as an embedded submanifold, which 
is transversal to closed horospheres of large volume, is central to the proof of Theorem 

|2|in ng. 

By standard probabilistic arguments, the statements of both theorems remain valid 
if / is a characteristic function of a subset 5cT"x T\G whose boundary has measure 
zero with respect to the limit measure dxdfi(M) or dx cf/i#(M) dX, respectively. The 
relevant set in our application is 

(22) S = V x {M G r\G : \Z n+1 M n €(A)\ > k}. 

The fact that S has indeed boundary of measure zero with respect to dx dfi(M) is 
proved in [151 Section 6]. We can therefore conclude: 

Theorem 3. Let k G Z>o, and T> C T n , A C lR n bounded with boundary of Lebesgue 
measure zero. Then 

(23) lim P Q (k, V, A) = P(k, A) 

Q— >co 

with 

(24) P(k,A) =fi{{MeT\G: \Z n+1 Mn€(A)\ — k}), 
which is independent of the choice ofT>. 

In the second case, we require that the set 

(25) {M eT H \H :\% n+l Mr\<l x (A)\>k}, £ X {A) := €.(A)a{X&), 

has boundary of measure zero with respect to which follows from analogous argu- 
ments. With this, we have: 

Theorem 4. Let k G Z>o, and T> C T n , A C M n bounded with boundary of Lebesgue 
measure zero. Then 



(26) 



lim P 0tQ {k,V,A) = P (k,A) = f p Q (k,€ x (A))dX. 



where 

(27) p (k,£) = fi H ({M G T H \H : |Z"' +1 Mn C)| = k}), 
which is independent of the choice ofT>. 

In dimension n > 2, it is difficult to obtain a more explicit description of the limit 
distributions P(k,A) and P (k,A). It is however possible to provide asymptotic esti- 
mates for large and small sets A when k = and k — 1, see [2j [18] for general results 
in this direction. The case of fixed A and large k is discussed in [TO] . 

The geometry of Y\G is significantly simpler in the case n — 1. This permits the 
derivation of explicit formulas for the limit distributions in many instances, cf. 
For example, take A = (0, s), and the cone £\(A) becomes the triangle 

(28) A SjA = {(x u x 2 ) G R 2 : < x x < fx 2 Xs, < x 2 < A _1/2 }, 
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where we have used o\ = — Furthermore Th\H is simply the circle T = R/Z, 
Hh is the standard Lebesgue measure. Hence 

(29) p (k, A a>A ) = meas({s G T : \{(p,q) G Z 2 : {p,px + q) G A S , A }| = fc}). 

It is now a geometric exercise to work out the case k = 0: With the shorthand y = A 1 / 2 
and a = (^s) -1 , we deduce 



(30) p (0,A s , x )= + ± iia<y<a(l-y)- 1 



if y < a 
if a < y 
V > a(l -y)' 



Solving for y, we have in the case < a < \ 

if y G [0, a] 

(31) p (0,A SiA )=<|l-J + ^ if yG [ a ,I-yr_^]u[| + t/j^M] 



For | < a < 1, we have 

(32) Po(0,A s , A ) 



if ^ G [I" \/4 a 4 + \/4 fl 



if y G [0, a] 
J + ^ ifyG[a,l], 



and for a > 1, we have 

(33) po(0,A,, A ) = l, ye [0,1]. 



The gap distribution Po(0, [0, s]) is now an elementary integral (recall (26)), which yields 

1 if a G [1, oo) 

(34) P (0, [Q,a]) = <j -l + 2o-2al ogo if a G [|, 1] 

-l + 2a + 2^/| - a- 4alog(| + ^J\-a) if a G [0, |]. 

which reproduces Hall's distribution [6J. The density of this distribution is 

(35) 

{0 ifaG[l,oo) 
-*§Vbga ifae[|,l] 
-^a 2 log(i + J\^a) if a e [0,1], 

cf. jH Theorem 2.1]. By [TTJ Theorem 2.2], we have 

(36) -^P(0,[0, S ]) = P (0,M), 



and hence formula (34) yields directly the density of the distribution of the distance to 
the nearest element. Formula (34) was rediscovered in [HI Lemma 2.6]. 

Theorems [T] and [2] reduce in the case n = 1 to classic statements in the theory of 
automorphic forms, with precise bounds on the rate of convergence. Sarnak [TH] proved 
Theorem [I] for test functions / G CJf (infinitely differentiate, compactly supported) 
that are independent of the first coordinate x, and showed that the optimal rate of 
convergence holds if and only if the Riemann Hypothesis is true (this phenomenon was 
first pointed out by Zagier [20]). The reason for the appearance of the Riemann zeros is 
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that the only relevant harmonics in the problem are Eisenstein series E2k(z,s) of even 
weight 2k, whose poles are located at the poles of 



(37) E 



<p(q) ((2s - 1) 



; 7 2 - C(2a) 

where <p(s) is Euler's totient function and £(s) the Riemann zeta function. 

Under the Riemann Hypothesis, Sarnak's rate is significantly better than what one 
would expected from square-root cancellations — it is the square-root of that. If the 
test function / depends on x (we assume again / is C^°), the work of Hejhal [7] and 
Strombergsson [T7] shows that the convergence rate slows to the square-root of the 
horocycle length (or worse) as other terms in the harmonics dominate the error coming 
from of the Riemann zeros. The object replacing the Eisenstein series in this setting is 
the Poincare series P m ,2k{ z , s) of weight 2k. 

The proof Theorem[2]for n = 1 on the other hand quickly reduces to estimates of sums 
over Kloosterman sums. To see this, note first of all that the statement of Theorem [2] 
is equivalent to: For every bounded continuous function / : T x r\H — > R (where EI is 
the complex upper half plane, on which T = SL(2,Z) acts by Mobius transformations) 
we have 

1 NT> V- Jp V .Q 2 \ f°° f 1 f\, \ j j dv 



v 2 



*» ^ e e 4 * + f ) = n i ,{x ' - + iv) dx du 

Here denotes the multiplicative group of invertible residues mod q, and p is the in- 



verse of p mod q. One way of proving (38) is to expand / £ in its harmonics (Fourier 
series in x and y and Mellin transform in A) which leads to Selberg's Kloosterman zeta 
function 



( 39 ) Z m 1 , m2 (s) = Y^ 

9=1 

with the Kloosterman sum 

(40) K(m 1 ,m 2 ,q) = ^ 



K(m 1 ,m 2 ,q) 



q 2s 



e 27ri(mip+m 2 p)/<? 



As in the case of the equidistribution of closed horocycles, where the asymptotics was 
determined by the poles of the Eisenstein and Poincare series, the poles of Z mi>rn2 (s) 



now determine the asymptotics of (38). Note that Z 0jrri2 (s) are precisely the Fourier 
coefficients of E (z,s) and, as already understood by Selberg, Z mi ^ m2 (s) is the m 2 - 
th Fourier coefficient of the Poincare series P mii o(z, s). Hence the appearance of the 
Riemann hypothesis in the error term of Theorem [2] mirrors exactly the situation in 
Theorem [H 
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